Goto

Collaborating Authors

 meteorological parameter


An Open and Large-Scale Dataset for Multi-Modal Climate Change-aware Crop Yield Predictions

Lin, Fudong, Guillot, Kaleb, Crawford, Summer, Zhang, Yihe, Yuan, Xu, Tzeng, Nian-Feng

arXiv.org Artificial Intelligence

Precise crop yield predictions are of national importance for ensuring food security and sustainable agricultural practices. While AI-for-science approaches have exhibited promising achievements in solving many scientific problems such as drug discovery, precipitation nowcasting, etc., the development of deep learning models for predicting crop yields is constantly hindered by the lack of an open and large-scale deep learning-ready dataset with multiple modalities to accommodate sufficient information. To remedy this, we introduce the CropNet dataset, the first terabyte-sized, publicly available, and multi-modal dataset specifically targeting climate change-aware crop yield predictions for the contiguous United States (U.S.) continent at the county level. Our CropNet dataset is composed of three modalities of data, i.e., Sentinel-2 Imagery, WRF-HRRR Computed Dataset, and USDA Crop Dataset, for over 2200 U.S. counties spanning 6 years (2017-2022), expected to facilitate researchers in developing versatile deep learning models for timely and precisely predicting crop yields at the county-level, by accounting for the effects of both short-term growing season weather variations and long-term climate change on crop yields. Besides, we develop the CropNet package, offering three types of APIs, for facilitating researchers in downloading the CropNet data on the fly over the time and region of interest, and flexibly building their deep learning models for accurate crop yield predictions. Extensive experiments have been conducted on our CropNet dataset via employing various types of deep learning solutions, with the results validating the general applicability and the efficacy of the CropNet dataset in climate change-aware crop yield predictions.


Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models

Majumder, Reek, Pollard, Jacquan, Salek, M Sabbir, Werth, David, Comert, Gurcan, Gale, Adrian, Khan, Sakib Mahmud, Darko, Samuel, Chowdhury, Mashrur

arXiv.org Artificial Intelligence

The environmental impacts of global warming driven by methane (CH4) emissions have catalyzed significant research initiatives in developing novel technologies that enable proactive and rapid detection of CH4. Several data-driven machine learning (ML) models were tested to determine how well they identified fugitive CH4 and its related intensity in the affected areas. Various meteorological characteristics, including wind speed, temperature, pressure, relative humidity, water vapor, and heat flux, were included in the simulation. We used the ensemble learning method to determine the best-performing weighted ensemble ML models built upon several weaker lower-layer ML models to (i) detect the presence of CH4 as a classification problem and (ii) predict the intensity of CH4 as a regression problem.


Computational Solar Energy -- Ensemble Learning Methods for Prediction of Solar Power Generation based on Meteorological Parameters in Eastern India

Chakraborty, Debojyoti, Mondal, Jayeeta, Barua, Hrishav Bakul, Bhattacharjee, Ankur

arXiv.org Artificial Intelligence

The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (ML) models have gained importance and are widely used for prediction of solar power plant performance. In this paper, the impact of weather parameters on solar PV power generation is estimated by several Ensemble ML (EML) models like Bagging, Boosting, Stacking, and Voting for the first time. The performance of chosen ML algorithms is validated by field dataset of a 10kWp solar PV power plant in Eastern India region. Furthermore, a complete test-bed framework has been designed for data mining as well as to select appropriate learning models. It also supports feature selection and reduction for dataset to reduce space and time complexity of the learning models. The results demonstrate greater prediction accuracy of around 96% for Stacking and Voting EML models. The proposed work is a generalized one and can be very useful for predicting the performance of large-scale solar PV power plants also.


Nowcasting lightning occurrence from commonly available meteorological parameters using machine learning techniques

#artificialintelligence

Lightning discharges in the atmosphere owe their existence to the combination of complex dynamic and microphysical processes. Knowledge discovery and data mining methods can be used for seeking characteristics of data and their teleconnections in complex data clusters. We have used machine learning techniques to successfully hindcast nearby and distant lightning hazards by looking at single-site observations of meteorological parameters. We developed a four-parameter model based on four commonly available surface weather variables (air pressure at station level (QFE), air temperature, relative humidity, and wind speed). The produced warnings are validated using the data from lightning location systems.


Modeling Daily Pan Evaporation in Humid Climates Using Gaussian Process Regression

Shabani, Sevda, Samadianfard, Saeed, Sattari, Mohammad Taghi, Shamshirband, Shahab, Mosavi, Amir, Kmet, Tibor, Varkonyi-Koczy, Annamaria R.

arXiv.org Machine Learning

Evaporation is one of the main processes in the hydrological cycle, and it is one of the most critical factors in agricultural, hydrological, and meteorological studies. Due to the interactions of multiple climatic factors, the evaporation is a complex and nonlinear phenomenon; therefore, the data-based methods can be used to have precise estimations of it. In this regard, in the present study, Gaussian Process Regression, Nearest-Neighbor, Random Forest and Support Vector Regression were used to estimate the pan evaporation in the meteorological stations of Golestan Province, Iran. For this purpose, meteorological data including PE, temperature, relative humidity, wind speed and sunny hours collected from the Gonbad-e Kavus, Gorgan and Bandar Torkman stations from 2011 through 2017. The accuracy of the studied methods was determined using the statistical indices of Root Mean Squared Error, correlation coefficient and Mean Absolute Error. Furthermore, the Taylor charts utilized for evaluating the accuracy of the mentioned models. We report that GPR for Gonbad-e Kavus Station with input parameters of T, W and S and GPR for Gorgan and Bandar Torkmen stations with input parameters of T, RH, W, and S had the most accurate performances and proposed for precise estimation of PE. Due to the high rate of evaporation in Iran and the lack of measurement instruments, the findings of the current study indicated that the PE values might be estimated with few easily measured meteorological parameters accurately.


Impacts of Weather Conditions on District Heat System

Xie, Jiyang, Ma, Zhanyu, Guo, Jun

arXiv.org Machine Learning

Using artificial neural network for the prediction of heat demand has attracted more and more attention. Weather conditions, such as ambient temperature, wind speed and direct solar irradiance, have been identified as key input parameters. In order to further improve the model accuracy, it is of great importance to understand the influence of different parameters. Based on an Elman neural network (ENN), this paper investigates the impact of direct solar irradiance and wind speed on predicting the heat demand of a district heating network. Results show that including wind speed can generally result in a lower overall mean absolute percentage error (MAPE) (6.43%) than including direct solar irradiance (6.47%); while including direct solar irradiance can achieve a lower maximum absolute deviation (71.8%) than including wind speed (81.53%). In addition, even though including both wind speed and direct solar irradiance shows the best overall performance (MAPE=6.35%).


SolarisNet: A Deep Regression Network for Solar Radiation Prediction

Dey, Subhadip, Pratiher, Sawon, Banerjee, Saon, Mukherjee, Chanchal Kumar

arXiv.org Machine Learning

Kyoto Protocol (KP) like strategic agreements on energy resources reflects the need for long run forecasting of renewable energy time series fluctuations and mitigate the problems of environment degradation due to emission exhausts from nonrenewable resources [1]. Photovoltaic systems for industrial and domestic uses require the distribution of grid connected power systems with solar radiation as the main energy source. However direct conversion of solar to electrical energy is costly and has relatively low efficiency [2]. Coupled with grid stability issues concerning scheduling and assets optimization for short-term (monthly)and long-term (yearly) forecasting requires guaranteed knowledge of solar radiation instabilities at local weather stations. All this information is based on satellite observations and data from ground stations, with uncertainty in geographic and time availability of data, and data sampling rate posing significant forecast granularity. To assess the PV plant operation dependability on global solar radiation (GSR), good measurement of GSR using a high class radiometer and correct controlling of the instrument through correct maintenance policy is essential.